Policy Learning with Hypothesis based Local Action Selection

نویسندگان

Bharath Sankaran

Jeannette Bohg

Nathan D. Ratliff

Stefan Schaal

چکیده

For robots to be effective in human environments, they should be capable of successful task execution in unstructured environments. Of these, many task oriented manipulation behaviors executed by robots rely on model based grasping strategies and model based strategies require accurate object detection and pose estimation. Both these tasks are hard in human environment, since human environments are plagued by partial observability and unknown objects. Given these constraints, it becomes crucial for a robot to be able to operate effectively under partial observability in unrecognized environments. Manipulation in such environments is also particularly hard, since the robot needs to reason about the dynamics of how various objects of unknown or only partially known shape interact with each other under contact. Modelling the dynamic process of a cluttered scene during manipulation is hard even if all object models and poses were known. It becomes even harder to reasonably develop a process or observation model, with only partial information about the object class or shape. To enable a robot to effectively operate in partially observable unknown environments we introduce a policy learning framework where action selection is cast as a probabilistic classification problem on hypothesis sets generated from observations of the environment. Online the action classifier is operated with a global stopping criterion for successful task completion. The example we consider is object search in clutter, where we assume having access to a visual object detector, that directly populates the hypothesis set given the current observation. Thereby we can avoid the temporal modelling of the process of searching through clutter. We demonstrate our algorithm on two manipulation based object search scenarios; a modified minesweeper simulation and a real world object search in clutter using a dual arm manipulation platform.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Differential Evolution and Spatial Distribution based Local Search for Training Fuzzy Wavelet Neural Network

Abstract Many parameter-tuning algorithms have been proposed for training Fuzzy Wavelet Neural Networks (FWNNs). Absence of appropriate structure, convergence to local optima and low speed in learning algorithms are deficiencies of FWNNs in previous studies. In this paper, a Memetic Algorithm (MA) is introduced to train FWNN for addressing aforementioned learning lacks. Differential Evolution...

متن کامل

The Impact of Studio-based learning on Metacognition and Design Ability of Architecture Students - Action Research

Proper training can put design learners in the right direction. It also enhances the power of drawing. Objective of this study was the effectiveness of architectural studio-based learning on increasing drawing power and metacognition abilities of students. This research seeks to answer these questions: Can architectural studio-based learning increase student design ability? Can architectural st...

متن کامل

Guided exploration in gradient based policy search with Gaussian processes

Applying reinforcement learning(RL) algorithms in robotic control proves to be challenging even in simple settings with a small number of states and actions. Value function based RL algorithms require the discretization of the state and action space, a limitation that is not acceptable in robotic control. The necessity to be able to deal with continuous state-action spaces led to the use of dif...

متن کامل

Novel Radial Basis Function Neural Networks based on Probabilistic Evolutionary and Gaussian Mixture Model for Satellites Optimum Selection

In this study, two novel learning algorithms have been applied on Radial Basis Function Neural Network (RBFNN) to approximate the functions with high non-linear order. The Probabilistic Evolutionary (PE) and Gaussian Mixture Model (GMM) techniques are proposed to significantly minimize the error functions. The main idea is concerning the various strategies to optimize the procedure of Gradient ...

متن کامل

Development of a visual Basic 6.0 based smart application for the design and selection of local exhaust ventilation systems

Introduction: Air pollution in industrial work environments has adverse effects on worker health, for example, chronic obstructive pulmonary disease and asthma. These diseases impose direct and indirect costs on society. In hierarchy controls, local exhaust ventilation is considered an "engineering control" to remove or control contaminants released in indoor work environments. It is one of the...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1503.06375 شماره

صفحات -

تاریخ انتشار 2015

Policy Learning with Hypothesis based Local Action Selection

نویسندگان

چکیده

منابع مشابه

A Differential Evolution and Spatial Distribution based Local Search for Training Fuzzy Wavelet Neural Network

The Impact of Studio-based learning on Metacognition and Design Ability of Architecture Students - Action Research

Guided exploration in gradient based policy search with Gaussian processes

Novel Radial Basis Function Neural Networks based on Probabilistic Evolutionary and Gaussian Mixture Model for Satellites Optimum Selection

Development of a visual Basic 6.0 based smart application for the design and selection of local exhaust ventilation systems

عنوان ژورنال:

اشتراک گذاری